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METHOD AND COMPUTER FOR DATA SET SEPARATION 

FIELD OF THE INVENTION 

This invention relates to a method and system for allocation of storage 
for data sets in a manner that minimizes single points of failure and improves 
access performance. 



BACKGROUND OF THE INVENTION 

Computer systems are known to have a host computer and a storage 
y 10 system. Data sets associated with applications that run on the host computer 
y are stored in the storage system. The storage system generally includes a 

H plurality of storage devices that are controlled by a plurality of controllers that 

i!| may each have one or more logical subsystems. A failure that occurs in a 

W component, such as a controller or a logical subsystem, impacts all data sets 

t 15 stored therein. Such a failure, sometimes known as a single point of failure, 
m can be problematic if all the data sets of a client are stored in the failed 

0 component. Performance of simultaneous data access may also be 

II problematic if the accessed data is stored on the same storage component. 

pis 
! 

20 The known computer systems generally allocate data sets for storage in 

the components of the storage system. For example, U.S. Patent No. 
5,790,886 describes an automated data storage space allocation system that 
uses prioritized parameters, such as available space, cache, performance, 
size, availability, location, portability and share status. To determine available 

25 storage devices, these data set parameters are compared with storage system 
characteristics, such as available space, cache, performance, portability, 
volatility, location, cost and fragmentation. The comparison is used to 
establish a linked chain of available storage devices in a preference ordered 
sequence. However, the automated procedure of this system does not 

30 address the problem of a single point of failure impacting all data sets 

allocated to the device in which the failure occurs. 
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Thus, a need exists for an improved data set allocation that is capable 
of data allocation in which storage locations for a data set can be allocated so 
as to separate the data set from other data sets. 

5 

SUMMARY OF THE INVENTION 

The method and computer of the present invention allocates data sets 
among storage volumes of a storage system according to requirements and/or 
preferences that separate critical data sets from one another so as to minimize 

10 the impact of a single point of failure that occurs in one part of the storage 
system. The method allocates a first data set by identifying at least one 
second data set from which the first data set is to be separated. Any of the 
storage resources of the storage system that contains a storage volume that 
stores the second data set are also identified. An eligible volume list is 

1 5 formed according to a policy for selection of a storage volume for the first data 
set. Any storage volume of the storage resource identified as containing the 
second data set is excluded from the eligible volume list. This assures that a 
failure in either a storage resource that is selected for the first data set or the 
storage resource that contains the second data set has a minimal impact on 

20 the other thereof. 

According to one aspect of the method of the present invention, the 
remainder of the storage volumes of the resources of the storage system are 
ordered according to the policy to prefer those that meet a preferred 
25 separation level ahead of those that do not meet the preferred separation 
level. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Other and further objects, advantages and features of the present 
30 invention will be understood by reference to the following specification in 



2 



SJO920010052US1 



10 



conjunction with the accompanying drawings, in which like reference 
characters denote like elements of structure and: 

Fig. 1 is a block diagram of a computer system of the present invention; 

Fig. 2 depicts storage allocations for the computer system of Fig. 1 ; 

Fig. 3 is a flow diagram of the data separation program of the Fig. 1 
computer system; and 

Fig. 4 depicts exemplary separation policies. 



i DESCRIPTION OF THE PREFERRED EMBODIMENT 

Referring to Fig. 1, a data processing system 20 includes a host 
1 5 computer 22, a storage system 24 and an administrative client 26. Host 

computer 22 includes a processor 28, an interface 30, a memory 32 and a bus 
34. Bus 34 interconnects processor 28, interface 30 and memory 32. Memory 
32 includes an operating system 36, a utilities program 38 and an application 
program 40. Operating system 36 includes a storage management program 
20 42 and a data set separation program 44. Storage management system 42 
may be any suitable system that manages storage of data sets in a data 
processing system, such as the Storage Management Sub-system (SMS) of 
the IBM® Corporation. Data set separation program 44 according to the 
present invention provides the capability of allocating different data sets to 
25 different storage components so as to minimize the impact of a failure in one 
of the components, such as a controller or a logical subsystem thereof. 

Storage system 24 may comprise any suitable storage system, such as 

disk drives, semiconductor memory, optical memory, and the like. For 

30 example, storage system 24 may comprise a plurality of storage disks with 

one or more controllers that communicate with host computer via interface 30. 
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The controller may have one or more logical subsystems that each have one 
or more storage volumes. 

Administrative client 26 is any suitable input device that is capable of 
5 operation by an administrative person (administrator) who inputs processing 
rules into host computer 22 for use by operating system 36, utilities 38 and 
application program 40. 

Data processing system 20 is capable of being interconnected with a 
u 1 0 memory media , such as a disk, 46 that contains the software code for 
St operating system 36, utilities 38 and application 40. That is, the software 

H contained on memory media 46 can be loaded into memory 32. 

ry 
Ut 

E{ Operating system 36 is operable to cause processor 28 to execute 

PI 

* 15 application 40. Storage management 42 is operable to allocate logical storage 

p% • ■ 

SI volumes for use by applications, such as application 42. Data set separation 

Ef program 44 is operable to permit data set separation allocation as an 

O extension to storage management system 42 or as a separate program entity. 



20 The administrator using administrator client 26 supplies one or more 

policy rules that are used by storage management program 42 to allocate 
storage volumes of storage system 24 among a plurality of data sets. Data set 
separation program 44 permits the administrator to extend storage allocation 
to allocate storage volumes to a data set based on a data set separation 

25 policy. 



Referring to fig. 2, an exemplary hierarchy 50 is shown for storage 

system 24. Hierarchy 50 includes two controllers, designated as controller 1 

and controller 2. Controller 1 includes two logical subsystems, designated as 

30 logical subsystem 1 and logical subsystem 2. Logical subsystem 1 includes 

two storage volumes, designated as volume 1 and volume 2. Logical 
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subsystem 2 includes two storage volumes, designated as volume 3 and 
volume 4. Controller 2 includes two logical subsystems, designated as logical 
subsystem 3 and logical subsystem 4. Logical subsystem 3 includes two 
storage volumes, designated as volume 5 and volume 6. Logical subsystem 4 
5 includes two storage volumes, designated as volume 7 and volume 8. It will 
be apparent to those skilled in the art that hierarchy 50 may include more or 
less than two controllers with each having more or less than two logical 
subsystems that each have more or less than two storage volumes. 



i'W 
fl 
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10 Controllers 1 and 2, logical subsystems 1-4 and the storage volumes 

thereof are sometimes referred to herein generically as storage resources. 
For example, a storage resource can be controller 1 , logical subsystem 1 or 
W logical subsystem 2. Each storage resource includes one or more storage 

jy volumes. For example, controller 1 includes volumes 1-4 and logical 

w 15 subsystem 1 includes volume 1 and volume 2. In a broader sense, shared 
p resources for which the data separation method and computer of the present 

invention apply may include any appropriate units, such as drive read/write 
heads, volumes, logical units (LUNs) controllers, logical subsystems, physical 
control units, data channels and paths, data directors and switches, host 
20 systems and the like. 

Data that is to be assigned for storage in a particular storage resource 
may be any form of data, such as a data set, record, field, object, data types, 
meta data, table, database, logs, files, file system and the like. The term "data 
25 set" is used herein in a generic sense to mean any of the data forms. 

Data set separation program 44 affords the capability for the 
administrator to separate data sets among separate storage resources. This 
provides the advantages of minimizing contention for resources and 
30 minimizing impact of single points of failure. For example, the administrator 
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can separate mission critical data sets from one another so as to minimize the 
impact of failure of a storage component. 

Referring to Figs. 2 and 4, two data separation policies A1 and B1 are 
5 provided as input to data set separation program 44. Separation policy A1 
lists two data sets CHECKING.ACCT and SAVINGS.ACCT A2 that require a 
separation A3 from each other on the logical subsystem 1 level. The term 
level represents a hierarchy of storage resources 50 where a given storage 
resource, such as logical subsystem 1 , may have a plurality of sub level 
10 storage resources, such as volumes 1 and 2. Separation policy B1 lists two 
5 data sets CHECKING.ACCT and CHECKING. BACKUP B2, which require a 

5j separation B3 from each other on the logical subsystem 1 level and prefers a 

N separation on the controller 1 level. 

U1 . 
M 

: *' 15 A separation policy may specify two or more data sets, which are to be 

0 separated. A data set may be specified in one or more separation policies, 

3 such as the CHECKING.ACCT data set, which is listed in separation policies 

m 

A1 and B1. Utilizing multiple separation policies in this example allows 

W 

HI' SAVINGS.ACCT data A2 to be placed on the same storage resource as 

20 CHECKING.ACCT data B2. 



Data sets that are to be listed in separation policies may be manually 
decided by the storage administrator or dynamically generated by the 
operating system 36, utilities 38 or application program 40 based on data 
25 attributes or historical information. How the separation policy is created is not 
a part of the present invention. 

Data set separation program 44 identifies data sets that are to be 

separated from one another. For example, a new data set is to be separated 

30 from one or more currently stored data sets. Referring again to Fig. 2, 

program 44 identifies shared storage resources within hierarchy 50. For 
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example, volumes 1 and 2 share logical subsystem 1 , volumes 3 and 4 share 
logical subsystem 2 and logical subsystems 1 and 2 share controller 1 . 
Program 44 identifies the storage location of a currently stored data as volume 
3. This is indicated by the symbol "##" in Fig. 2. 

5 

Data set separation program 44 then maps required and preferred 
levels of separation to the storage resources. The separation levels may be 
provided by the administrator. For example, the administrator identifies logical 
subsystem as a required separation level and controller as a preferred 
10 separation level. The mapping operation rejects or excludes any storage 
Q resource that does not meet the required separation level. In the example, 

hj storage volume 3 resides in logical subsystem 2. Therefore, logical 

subsystem 2 and its storage volumes 3 and 4 are excluded from the allocation 

PI consideration. The remaining storage resources, volumes 1 and 2 and 

fi J 

s 15 volumes 5-8, are then ranked in an eligible volume list 52 so as to prefer 
|| volumes that meet the preferred separation level ahead of those that do not. 

!,.,.£ 

11*1: 

q Volumes 5-8 meet the preferred separation level (controller) and 

m volumes 1 and 2 do not. Thus, volumes 5-8 have the highest ranking, 

20 designated by the numeral 1, and volumes 1 and 2 have a lower ranking, 
designated by the numeral 2. 



The allocation procedure then attempts to place the new data set on 
volumes 5-8. If unsuccessful, allocation is then attempted on volumes 1 and 
25 2. If unsuccessful, the allocation by data set separation fails for lack of ability 
to separate based on the set criterion and allocation is by storage 
management program 42 continues as if there were no separation criterion. 

Referring to Fig. 3, data set separation program 44 determines at step 

30 60 if the data being allocated is listed in a data separation policy, which for 

example is provided by the administrator. If not, allocation of data by storage 
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management program 42 is attempted at step 62. That is, storage 
management program 42 will allocate data for storage according to its policy 
rules. If step 60 determines that the data being allocated is listed in the policy, 
step 64 finds the storage location of the currently stored data from which 
5 separation is required. Step 66 determines if other data locations exist. If not, 
step 62 is performed. If so, step 68 determines if there is a required level of 
separation. If not, step 70 is performed to determine if there is a preferred 
level of separation. If step 68 determines that there is a required separation 
level, the volumes of the shared resource level are removed or excluded from 
10 eligible volume list 52. In the example of Fig. 2, the currently stored data is in 
volume 3. The shared resource is logical subsystem 2. The shared volumes 
3 and 4 are excluded. 

At step 70, it is determined if a preferred data separation level is 
1 5 specified. If not, step 76 selects the next eligible volume on eligible volume list 
52. For example, there is no preferred separation level and all volumes, 
except for those excluded by step 72, are unranked for the data separation 
allocation. If step 70 determines that there is a preferred data separation 
level, at step 74 the volumes that share the same resource are moved to the 
20 bottom of eligible volume list 52. In the example of Fig. 2, volumes 1 and 2 
share controller 1 with volume 3 and are moved to the bottom (rank 2) of 
eligible volume list 52. 

At step 76, the next eligible volume on eligible volume list 52 is 

25 selected. Step 78 determines if the end of eligible volume list 52 is reached. If 

not, step 80 determines if space is successfully allocated to the selected 

volume. If so, a normal return is performed at step 82. If not (e.g., the 

selected volume has no free space), step 76 is repeated until step 80 

determines that space is successfully allocated or step 78 determines that the 

30 eligible volumes have been exhausted. That is, all eligible volumes have 

already been considered and are unavailable. This results in a failed 
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allocation by data set separation at step 84 with an abnormal return to storage 
management program 42. 

The present invention having been thus described with particular 
5 reference to the preferred forms thereof, it will be obvious that various 

changes and modifications may be made therein without departing from the 
spirit and scope of the present invention as defined in the appended claims. 
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